SYSTEM AND METHOD OF DECODING A 
PACKED REPRESENTATION OF MULTIPLE PARSES 



FIELD OF THE INVENTION 

The present invention generally relates to natural language parsing, 
and more particularly to natural language parsing that is suitable for automatic 
speech recognition. 

BACKGROUND OF THE INVENTION 

In general, a semantic representation is generated from a 
natural-language utterance by the following process. The natural-language 
utterance is processed by a speech recognizer, which is coupled to a lexicon 
containing a plurality of words of the natural language. The speech 
recognizer outputs a word graph, containing at least one, and normally 
multiple words from the lexicon that is hypothesized to correspond to the 
natural-language utterance. The word graph is input into a parser, which is 
coupled to a grammar for the natural language. Using this grammar, the 
parser constructs a parse forest during the parse. 

The parse forest is then inputted into a semantic interpreter, which is 
coupled to a knowledge base. The semantic interpreter processes the parse 
forest according to a predetermined semantics and outputs at least one 
semantic representation corresponding to the natural language utterance. 

One problem with this process is that multiple word hypotheses in the 
word graph combined with ambiguities in the grammar may yield a large 
number of parses. Thus, the parse forest may become unmanageably 
complex. Although techniques exist to ameliorate this problem, they may 
remove structures that may later turn out to be. useful. Alternatively, if the 
parse forest is packed during the parsing process, the problem of its 
complexity is only deferred, since an accurate and efficient means is still 
required to unpack the forest for processing by the semantic interpreter. 

Primarily, it would be desirable to provide a method of decoding or 
unpacking the individual parses stored in the packed representation of a 



parsed word graph to yield a forest of parse trees, preferably in an order that 
represents their probable correctness or usefulness according to a 
predetermined metric. 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a schematic representation of a speech recognition system 
including a packed representation decoding module in accordance with the 
invention; 

FIG. 2 is a flow chart of an embodiment of a top-level unpacking 
process in accordance with the invention; 

FIG. 3A is a flowchart of one embodiment of the "get next edgenode" 
process in accordance with the invention; 

FIG. 3B is a schematic representation of the states of the 
self-embedding operation as defined for forest objects; 

FIG. 3C is a schematic representation of the states of the N-fold 
replication operation as defined for forest objects; 

FIG. 4 is a schematic representation of a packed forest; 

FIG. 5 is a schematic representation of the packed forest of FIG. 4 with 
most probable and alternative parses; 

FIG. 6 is a schematic representation of the unpacking process of the 
packed forest of FIG. 4; 

FIG. 7 is a schematic representation of an unpacked forest resulting 
from unpacking the packed forest of FIG. 4; 

FIG. 8 is a schematic representation of a word graph output from a 
speech recognizer; and 

^^~FIG. 9 is a IjIML diagram of a preferred embodiment for the forest and 
its related classes. \ 
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DETAILED DESCRIPTION OF THE 
PRESENTLY PREFERRED EMBODIMENTS 

deferring to FIG. 1 , a schematic diagram of a system 1 0 for encoding 

/ and decochng multiple parses is shown. The speech recognizer (SR) 11 

5 produces outout or word graph 12, which is sent to parser 14. A grammar 16 

is used to produce a packed representation 18 of the multiple parses. A 

preferred methocmnd system for encoding the packed representation 18 is 

disclosed in U.S. Application No. entitled "Method, Device 

and System For Generalized Bidirectional Island-Driven Chart Parsing," the 

10 entire disclosure of whicKjs incorporated herein by reference. The packed 

O representation 18 is then unpacked using the unpacking module 20 to create 

m an unpacked forest 22 which isvjnput to a semantic interpretation module 24. 

Jj; A flow chart of a preferred emboahqrient of the unpacking module 20 is 

W presented in FIGS. 2 and 3A. The uhpacking module 20 may include any 

5f> suitable computer programming code using any conventional programming 

!\ language, and may be stored on any conventional computer readable 

p medium. 

When the parser 14 is run in multiple-parse mode, the resulting parses 

j={ are generated and stored in a packed representation 25, a simplified example 

20 of which is shown in FIG. 4. The packed representation 25 encodes: (1) a set 

of grammatical paths through the word graph sent to the parser 14 by the SR 

11, and (2) the corresponding syntactic structures assigned to these paths by 

the parser 14 (the syntactic structure assigned to a path allows it to be 

analyzed semantically by the semantic interpretation module, which operates 

25 on the parse trees constructed by the parser). The subgraph of the SR word 

graph that contains the set of grammatical paths will be referred to as the 

"active word graph." 

A simplified representation of a SR word graph 28 corresponding to the 

packed representation in FIG. 4 is shown in FIG. 8; the heavy lines 26 

30 represent the "best path" through the word graph 28 (corresponding to the 

terminals of the most probable parse tree), while the heavy lines 26 together 

with the solid lines 27 represent the active word graph and the dashed lines 



29 represent paths not selected by any parse. It is important to note that the 
active word graph may contain paths that are not selected with respect to the 
parses stored in the packed representation 25 (i.e., they do not have valid 
syntactic structures and thus cannot be analyzed semantically); in the 
example shown in FIG. 5, it can be seen that the path (wO, w1, w4, w6) does 
not correspond to a valid parse, even though it occurs in the active word 
graph. 

ie parser's packed representation 25 (see FIG. 5) consists of a 
hierarchically ordered set of edgenodes corresponding to Complete edges 
from the parser's chart. The "most probable" parse tree that is returned by the 
parser 14 (ana\which provides the basis for further processing) is constructed 
by extracting frorn the packed representation 25, the information necessary to 
construct a treenode corresponding to the most probable edgenode at each 
node of the parse tre^. The edge class is used by the parser for encoding the 
course of the parse; consequently, edge objects are relatively large and have 
complex behavior. Although encoding a parse tree by use of edge objects 
has no apparent impact on parser performance when it returns only one parse 
tree per input word graph, returning all possible parses in this manner may 
have an adverse impact on systern performance. A solution to this problem is 
to use the C++ inheritance mechanfern to split the data and behavior of the 
edge class into: (1) a base class ("treehode"), which contains only the data 
and behavior that is necessary to define and process parse trees returned by 
the parser, and (2) a derived class (treenoa^: edge), which contains all of the 
remaining data and behavior (required for constructing and maintaining the 
parses in the parser's agenda and chart objectsV In the discussion that 
follows, the base class "treenode" will be referred tb as a treenode, while the 
derived class "edge" will be referred to as an edgenc^e. 

Referring to FIG- 4, the packed representation 25 consists of a subset 
of edgenodes corresponding to the (complete) edges in the parser's chart. 
Each edgenode 30 has associated with it a substitution list 32, membership in 
which is shown in the packed representation schematic in FIG. 4 by inclusion 
in a rectangle 32, with ordering in the substitution list indicated by an arrow 31 



(e.g., edgenodes CO (32C0) and C1 (32C1) are in the same substitution list 
32; note that the numeric indices are used for expository purposes only, as 
both edges 32C0 and 32C1 represent expansions of the nonterminal symbol 
"C," which can be seen by reference to the active ruleset 40 (upper right-hand 
corner of FIG. 5). 

Substitution lists 32 have the following properties: (1) each edgenode 
in the list has the same label (i.e., all edgenodes were generated by clauses 
of the same BNF (Backus-Naur Form) rule, though they may correspond to 
different phrase-structure rules); (2) each edgenode in the list corresponds to 
an edge with the same starting and ending vertices in the parser's chart 
(though the paths connecting these vertices may be different); and (3) the 
edgenodes in the list are ordered according to score, with the "best" 
edgenode (according to the scoring metric) being the first element of the 
substitution list. Given these properties, a traversal of the packed 
representation 25 that expands the first edgenode in each substitution list will 
yield the tree that corresponds to the most probable parse (lower left-hand 
corner of FIG. 5), the terminals of which define the best path through the 
active word graph 28 (heavy line in FIG- 8). This algorithm is used to obtain 
the best parse whether or not the parser is running in multiple-parse mode, 
since it is possible (and, in fact, likely) that the parser 14 will obtain multiple 
structures over subspans of the word graph 28 that will be incorporated into 
the first successful parse 14 of the word graph 28 (note that, when run in 
single-parse mode, the substitution list of the topmost edgenode in the packed 
representation returned by the parser 14 is guaranteed to contain only one 
edgenode). * 

An unordered set of alternative parses can be obtained from a given 
packed representation 25 by an exhaustive traversal of the representation 
which incorporates the processing of the substitution lists as described below. 
If {E n } is the set of n ecraenodes that define the most probable parse obtained 
from the packed representation, and ki is the number of edgenodes in the 
substitution list corresponding to the edgenode E„ then the number of parse 
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n 

trees returned by ft^e traversal will be less than or equal to jj k . The 

* ^ i 

number of parse treesViay be less than this number due to dependencies 
resulting from the dominance relationships in the tree; (e.g. in the example 
shown in FIG. 5, there are only three parses (rather than four) because the 
substitution list containing edgenodes DO and D1 is dominated by only one of 
the edgenodes in the substitution list containing edgenodes CO and C1 , not 
both of them). 

As discussed above, the most probable parse for a given sentence is 
obtained from the packed representation 25 generated by the parser 14 
W (after it has parsed the sentence) by choosing the first element from each 

%l substitution list 32 during a traversal of the packed representation 25. An 

[Jf ordered set of parses may therefore be obtained from the packed 

o representation 25 by generalizing the traversal algorithm to operate on 

"forests" of trees as follows: (1) each time a substitution list 32 containing 
Jt5 more than one edgenode (e.g., 32C0, 32C1) is encountered during the 

Q traversal, replicate the partial trees in the forest currently under construction 

Ji: as many times as there are edgenodes in the substitution list 32; (2) update 

Q each copy of the forest with a treenode corresponding to a different edgenode 

from the substitution list 32; and (3) continue the traversal through each copy 
20 of the forest according to the order of elements in the substitution list 32 

(thus completing the processing of all of the edgenodes below a particular 
edgenode in the list before moving on to the next edgenode in the list). The 
unpacking module 20 is an object-oriented implementation based on this 
algorithm. 

25 Note that this algorithm ensures only that all of the parse trees 

encoded in the packed representation 25 are obtained, not that they are 
ordered according to probability (although the best parse, as noted above, is 
guaranteed to be first). This is because the algorithm generates all of the 
subtrees corresponding to the expansion of a given edgenode in a 

30 substitution list 32 before it generates any of the subtrees corresponding to 



the subsequent edgenodes in the substitution list 32. In the example shown 
in FIG. 5, all of the subtrees containing a treenode labeled CO will be 
generated before any of the subtrees that contain a treenode labeled C1 . 
While there is strict ordering on the first subtree in each of these sets (i.e., the 
subtree containing the treenodes labeled CO and DO is more probable than 
the one containing the treenode labeled C1), such an ordering does not 
necessarily hold for any other pair of subtrees drawn from the sets (e.g., in the 
example, depending on the probabilities involved, the second-most-probable 
tree 37 containing treenodes CO and D1 may be either more or less probable 
than the most probable tree containing treenode C1 (38). This requires that 
each new tree, after it has been generated by the algorithm presented above, 
must be merged into the list of completed trees (rather than added to its end) 
based on its probability, in order to ensure that the final list is ordered 
according to the trees' probability of occurrence. 

Referring to FIGS- 3B and 3C, a forest object is represented as a box 
with two compartments. The upper compartment of the box contains the label 
of the forest object and the lower compartment of the box contains a 
specification of the state of the forest object as follows. A "A." in the lower 
compartment of the box specifies that the forest object is empty. A "U" in the 
lower compartment of the box indicates that the state of the forest object is 
unspecified (i.e., whether or not it is empty is irrelevant to the example.) If the 
forest object is neither empty nor in an unspecified state, then the lower 
compartment of the box is empty and the entire contents of the forest object is 
indicated by at least one arrow that connects it to at least one other forest 
object. If the number of forest objects contained in the set of connected forest 
objects is unspecified, the contents of the forest object are indicated by 
showing the first and last elements of the set, suitably indexed, with ellipses 
indicating the intervening forest objects in the set and ellipses indicating the 
arrows that connect these intervening forest objects to the containing forest 
object. 

Referring to FIG. 3B, the self-embed operation on a forest object 
labeled Fx is defined as: (1) relabel Fx as Fx.O, (2) construct an empty forest 
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object labeled Fx, and (3) add Fx.O to Fx. FIG. 3B presents a schematic of 
the self-embed operation showing the initial state (3B.0), the state after step 1 
(3B.1), the state after step 2 (3B.2), and the state after step 3 (3B.3), which is 
the final state. 

The N-fold replication operation on a forest object labeled Fx is defined 
as: (1) perform the self-embed operation on Fx; (2) construct N - 1 copies of 
Fx.O, labeled Fx.1 through Fx.(N-1); and (3) add Fx.1 through Fx.(N-1) to Fx. 
FIG. 3C presents a schematic of the N-fold replication operation showing the 
initial state (3C.0), the state after step 1 (3C.1), the state after step 2 (3C.2), 
and the state after step 3 (3C.3), which is the final state. 

Two other operations on forest objects are defined as follows: 

(1 ) The vector-update operation for a forest object Fx and a vector 
V of edgenodes is only possible if Fx is of cardinality N (i.e., it contains N 
embedded forests, labeled Fx.O through Fx. (N-1)) and vector V is of length N. 
When this is true, the operation proceeds by creating a treenode Ti 
corresponding to the i th element of vector V and adding Ti to forest Fi (the i th 
forest object in Fx) under its active node, thus making it the new active node 
of Fx.i. 

(2) The scalar-update operation for a forest object Fx and a 
treenode T is defined as: (1) construct N copies of T, where N is the 
cardinality of Fx, storing the copies in a set {T1 TN}, and (2) for each forest 
object Fx.i in Fx, add the treenode Ti to its active node. Note that adding a 
treenode to the active node of a forest that itself contains one or more forests 
has the effect of recursively adding this treenode to the active node of each of 
the embedded forests. 

Referring to FIG. 9, the above algorithm can be implemented by use of 
the following four classes: 

1 . The Treenode class 42 encodes a single node in a parse tree 
and can directly dominate an unspecified number of other Treenodes 42. 
Each Treenode 42 contains an unbounded collection of Treenode pointers 
which reference its directly subordinate nodes 42a and also includes a pointer 
to the Treenode 42 (if any) that directly dominates it 42b. Additionally, 



because of the synchronous construction behavior required by the Tree class, 
each Treenode 42 needs to "know" whether or not it is still under construction 
(i.e., whether it can still receive subordinate nodes). This requires special 
behavior by the Treenode copy constructor (which recursively performs a 
deep copy of all Treenodes dominated by the Treenode being copied), in 
order to ensure that this information is properly identified and transmitted. 

2. The Tree class 44 encodes a single parse tree (which is either 
complete or "partial," i.e., under construction), consisting of one or more 
Treenode objects 42. Pointers to two specific Treenode objects can be 
requested from a Tree object, its root (Tree root pointer 44a), and its "active 
node" (Active Treenode pointer 44b). The latter is NULL if the tree 44 is 
complete. If the tree 44 is partial, then the active node is the one that is 
modified during the course of tree completion. This encodes the restriction 
that a Tree object can only grow at one point during its construction (though, 
of course, this point will move around within the Tree). Because the Treenode 
copy constructor performs a recursive deep copy, the Tree copy constructor is 
accordingly simplified (i.e., to copying the root node and then ensuring that 
the active node information is properly transmitted). 

3. The Forest class 46 contains either: (1) a single Tree object 46a 
(which may be either partial or complete); or (2) an ordered set of one or more 
Forest objects 46b. In the latter case, the Forest objects 46b will be either all 
partial or all complete, thus a Forest object 46b itself may be also said to be 
either partial or complete. If the Forest 46 is partial, then it has an "active 
nodeset," which consists of the set of all active nodes of its constituent objects 
(i.e., either Trees or Forests). This nodeset may be updated by either: (1) 
adding a single Treenode to it, with the result that the Treenode is cloned and 
added to all of the objects in the Forest (at their active nodes or nodesets, 
which are updated accordingly); or (2) adding a vector of Treenodes which is 
the same length as the number of Forests in the Forest (with the result that 
the vector elements are added in pairwise fashion to the active nodes of the 
Forests). This permits the simultaneous "growing" of the Tree objects in the 
Forest. Execution of the node-closure operation on the active nodeset results 
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in the new active node for each Tree in the Forest being determined by that 
Tree's geometry, with the Forest's nodeset updated accordingly. Finally, an 
n-fold replication operation may be applied to a given Forest object, resulting 
in a new Forest object that contains n identical copies of the original Forest 
object. 

4. The Edge class 48 is the class used by the parser 14 to encode 
edge information during the course of the parse. The Edge class is 
composed of: (1) a proper subset of the data and functions of the Edge class 
currently used by the parser 14, and (2) the static "LoadEdge" function used 
to interpret the commands of the declarative packed-edge specification 
language. Every Edge 48 contains zero or one Previous Edge pointers 48a, 
zero or one Next Edge pointers 48b, and zero or more back pointers 48c. As 
shown in the UML diagram of FIG. 9, all interaction with this class is handled 
through the Forest class 46; thus the integration of the functionality described 
above into the parser 14 is straightforwardly localized to a single interface. 
Edges and Treenodes may be implemented as separate classes. 
Alternatively, they may be linked by inheritance in order to streamline the 
implementation of the parser. 

As discussed above, when the parser 14 is run in multiple-parse mode, 
the resulting parses are generated and stored in a packed representation 25, 
a simplified example of which is shown in the upper left-hand corner of FIG. 6. 
The most probable parse 36 for a given sentence is obtained from the packed 
representation 25 by choosing the first element 32a from each substitution list 
32 during a traversal of the packed representation 25. As also discussed 
above, an ordered set of parses may therefore be obtained from the packed 
representation 25 by generalizing the traversal algorithm to operate on 
"forests" of trees as follows: 

During the traversal of the packed representation: 

1 . Each time the first edgenode in a substitution list 32 containing 
N elements, N > 1 , is encountered, an N-fold replication of the current forest 
Fx is performed. 
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2. A vector update of Fx is then performed using the substitution 
list (which contains N elements). 

3. The current forest pointer is set to point at the forest Fx.O and 
the current node pointer is set to point at the next edgenode in the packed 
representation. 

The traversal then continues through each forest Fx.i in Fx according to 
the order of elements in the substitution list 32 (thus completing the 
processing of all of the edgenodes below a particular edgenode in the list 
before the processing of the next edgenode in the list is begun). 

The traversal of the packed representation therefore involves the 
updating of two pointers: (1) the current node pointer, which references the 
edgenode that is currently being processed in the packed representation, and 
(2) the current forest pointer, which references the forest object currently 
being modified. As discussed above, a forest object is modified by either 
adding a given treenode to the "active node" of the forest object or by 
applying the "node closure" operation to the forest's active node (which, 
depending on the internal structure of the forest, will result in either a new 
active node being activated in the forest or in the forest becoming complete; 
i.e., containing no active node). In the initial state of the unpacking algorithm, 
the current node pointer points at the root edgenode in the packed 
representation and the current forest pointer points at an empty forest object 
labeled FO. 

Referring to FIGS. 2 and 3A an example of a preferred embodiment of 
the unpacking process is illustrated. From the packed representation input 
(block 49) the root edgenode 30A (FIG. 4) of the packed representation 25 is 
obtained and becomes the current edgenode (block 50). An empty forest 
object labeled FO is created and becomes the current forest (block 51). 
Because the current edgenode 30A is the first edgenode in the substitution 
list 32A (block 52), N is set to the length of the substitution list for the current 
edgenode (block 53). For edgenode 30A, the length of the substitution list 
32A is equal to 1 . Because N is not greater than 1 (block 54) a scalar update 
of the current forest FO is performed with the current edgenode (block 55), 
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resulting in a treenode labeled A being added to the (empty) current forest FO, 
where it becomes the active node. 

The next edgenode is now obtained (block 59) starting at block 64 as 
shown in FIG. 3A. Because the current edgenode 30A has at least one child 
(block 66), the next edgenode is set to the leftmost child of the current 
edgenode (block 68), that is, to edgenode 30B. Because the next edgenode 
is not NULL (block 60), the current edgenode is set to the next edgenode 30B 
(block 61). 

Next, because the current edgenode 30B is the first edgenode in its 
substitution list 32B (block 52), N is set to the length of the substitution list for 
the current edgenode (block 53). For edgenode 30B, the length of the 
substitution list 32B is equal to 1 . Because N is not greater than 1 (block 54) 
a scalar update of the current forest FO is performed with the current 
edgenode (block 55), resulting in a treenode labeled B being added to the 
current forest FO, under its active node (the treenode labeled A). Because the 
edgenode 30B is a nonterminal node, its corresponding treenode labeled B is 
also a nonterminal node, thus the nonterminal treenode labeled B becomes * 
the new active node in the current forest FO. 

The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30B has at least one child (block 66), the next 
edgenode is set to the leftmost child of the current edgenode (block 68), that 
is, to edgenode 30W0. Because the next edgenode is not NULL (block 60), 
the current edgenode is set to the next edgenode 30W0 (block 61). 

Next, because the current edgenode 30W0 is the first edgenode in its 
substitution list 32W0 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W0, the length of the 
substitution list 32W0 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest FO is performed with the 
current edgenode (block 55), resulting in a treenode labeled wO being added 
to the current forest FO, under its active node (the treenode labeled B). 
Because the edgenode 30W0 is a terminal node, its corresponding treenode 
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labeled wO is also a terminal node, thus the treenode labeled B remains the 
active node in the current forest FO. 

The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30W0 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W0 (block 70). Because it is the only edgenode in the substitution list 
32W0, it is necessarily the last one, so it is tested to see if it has a sibling to its 
immediate right (block 76). Because it does not have a sibling to its 
immediate right, the node closure operation is applied to the current forest FO. 
The active node of the current forest FO is the treenode labeled B. Thus the 
application of the node closure operation results in the treenode labeled A 
(which is the parent node of the treenode labeled B) becoming the new active 
node of the current forest FO (block 77). The current edgenode is then tested 
to see if it has a parent (block 82). Because the current edgenode 30W0 has 
a parent (edgenode 30B), the current edgenode becomes edgenode 30B 
(block 86). Because the current forest is FO, it remains unchanged (block 87). 

The new current edgenode 30B is now tested to see if it is the last 
edgenode in the substitution list 32B (block 70). Because it is the only 
edgenode in the substitution list, it is necessarily the last one, so it is tested to 
see if it has a sibling to its immediate right (block 76). Because it has a sibling 
to its immediate right, the next edgenode is set to this sibling, that is, to 
edgenode 30C0 (block 78). Because the next edgenode is not NULL 
(block 60), the current edgenode is set to the next edgenode 30C0 (block 61). 

Next, because the current edgenode 30C0 is the first edgenode in its 
substitution list 32C (block 52), N is set to the length of the substitution list for 
the current edgenode (block 53). For edgenode 30C0, the length of the 
substitution list 32C is equal to 2. Because N is greater than 1 (block 54) an 
N-fold replication of the current forest FO is performed (with N equal to 2). 
Referring to FIG. 6, this replication results in the partial tree 90A in the current 
forest FO being copied into two new forests 91 and 92, respectively labeled 
FO.O and F0.1 , which become the new contents of the current forest FO. Next, 
a vector update of the current forest FO is performed with the substitution list 
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32C of the current edgenode 30C0. This results in a treenode labeled CO 
(from the first element of the substitution list 32C) being added under the 
active node (the treenode labeled A) of forest FO.O (thus becoming the new 
active node of this forest), and a treenode labeled C1 (from the second 
element of the substitution list 32C) being added under the active node (the 
treenode labeled A) of forest F0.1 (thus becoming the new active node of this 
forest), as shown in the box labeled "First Replication" in the upper right-hand 
corner of FIG. 6. The current forest is now set to the first forest in FO (90), 
that is, to forest FO.O (91) (block 58). 

The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30C has at least one child (block 66), the next 
edgenode is set to the leftmost child of the current edgenode (block 68), that 
is, to edgenode 30D0. Because the next edgenode is not NULL (block 60), 
the current edgenode is set to the next edgenode 30D0 (block 61 ). 

Next, because the current edgenode 30D0 is the first edgenode in its 
substitution list 32D (block 52), N is set to the length of the substitution list for 
the current edgenode (block 53). For edgenode 30D0, the length of the 
substitution list 32D is equal to 2. Because N is greater than 1 (block 54) an 
N-fold replication (94) of the current forest FO.O is performed (with N equal to 
2). This replication results in the partial tree in the current forest FO.O (91) 
being copied into two new forests, respectively labeled FO.0.0 (95) and F0.0.1 
(96), which become the new contents of the current forest FO.O (91). Next, a 
vector update of the current forest FO.O (91) is performed with the substitution 
list 32D of the current edgenode 30D0. This results in a treenode labeled DO 
(from the first element of the substitution list 32D) being added under the 
active node (the treenode labeled CO) of forest FO.0.0 (thus becoming the 
new active node of this forest) and a treenode labeled D1 (from the second 
element of the substitution list 32D) being added under the active node (the 
treenode labeled CO) of forest F0.0.1 (thus becoming the new active node of 
this forest), as shown in the box 94 labeled "Second Replication" in the left 
central portion of FIG. 6. The current forest is now set to the first forest in 
FO.O, that is, to forest FO.0.0 (95) (block 58). 
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The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30 DO has at least one child (block 66), the 
next edgenode is set to the leftmost child of the current edgenode (block 68), 
that is, to edgenode 30W2. Because the next edgenode is not NULL 
(block 60), the current edgenode is set to the next edgenode 30W2 (block 61). 

>xt, because the current edgenode 30W2 is the first edgenode in its 
substitution\(ist 32W2 (block 52), N is set to the length of the substitution list 
for the currenr^dgenode (block 53). For edgenode 30W2, the length of the 
substitution list 3zW2 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest FO.0.0 is performed with the 
current edgenode (block 5^), resulting in a treenode labeled w2 being added 
to the current forest FO.0.0, uhder its active node (the treenode labeled DO). 
Because the edgenode 30W2 is asterminal node, its corresponding treenode 
labeled w2 is also a terminal node, tnbs the treenode labeled DO remains the 
active node in the current forest FO.0.0 (95)^ 

The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30W2 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W2 (block 70). Because it is the only edgenode in the substitution list 
32W2, it is necessarily the last one, so it is tested to see if it has a sibling to its 
immediate right (block 76). Because it has a sibling to its immediate right, the 
next edgenode is set to this sibling, that is, to edgenode 30W3 (block 78). 
Because the next edgenode is not NULL (block 60), the current edgenode is 
set to the next edgenode 30W3 (block 61). 

Next, because the current edgenode 30W3 is the first edgenode in its 
substitution list 32W3 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W3, the length of the 
substitution list 32W3 is equal to 1 . Because N is not greater than 1 (block 
54) a scalar update of the current forest FO.0.0 (95) is performed with the 
current edgenode (block 55), resulting in a treenode labeled w3 being added 
to the current forest FO.0.0 under its active node (the treenode labeled DO). 
Because the edgenode 30W3 is a terminal node, its corresponding treenode 
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labeled w3 is also a terminal node, thus the treenode labeled DO remains the 
active node in the current forest FO.0.0. 

The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30W3 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W3 (block 70). Because it is the only edgenode in the substitution list 
32W3, it is necessarily the last one, so it is tested to see if it has a sibling to its 
immediate right (block 76). Because it does not have a sibling to its 
immediate right, the node closure operation is applied to the current forest 
FO.0.0. The active node of the current forest FO.0.0 is the treenode labeled 
DO. Thus the application of the node closure operation results in the treenode 
labeled CO (which is the parent node of the treenode labeled DO) becoming 
the new active node of the current forest FO.0.0 (block 77). The current 
edgenode is then tested to see if it has a parent (block 82). Because the 
current edgenode 30W3 has a parent (edgenode 30D0), the current 
edgenode becomes edgenode 30D0 (block 86). Because the current forest is 
FO.0.0, not F0 (block 87), it is changed to the forest of the parent of the 
current edgenode (block 88), that is, to F0.0. 

The new current edgenode 30D0 is now tested to see if it is the last 
edgenode in the substitution list 32D (block 70). Because it is not the last 
edgenode in the substitution list 32D, the next edgenode is set to the next 
edgenode in the substitution list 32D (block 72), that is, to edgenode 30D1 . 
The current forest is now set to the forest of edgenode 30D1 , that is, to the 
forest labeled F0.0.1 (block 74). Because the next edgenode is not NULL 
(block 60), the current edgenode is set to the next edgenode 30D1 (block 61). 

Because the current edgenode 30D1 is not the first edgenode in the 
substitution list 32D (block 52), the next edgenode is now obtained (block 59) 
as shown in FIG. 3A. Because the current edgenode 30D1 has at least one 
child (block 66), the next edgenode is set to the leftmost child of the current 
edgenode (block 68), that is, to edgenode 30W1 . Because the next 
edgenode is not NULL (block 60), the current edgenode is set to the next 
edgenode 30W1 (block 61). 
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Next, because the current edgenode 30W1 is the first edgenode in its 
substitution list 32W1 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W1, the length of the 
substitution list 32W1 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest FO.0.1 is performed with the 
current edgenode (block 55), resulting in a treenode labeled w1 being added 
to the current forest FO.0.1, under its active node (the treenode labeled D1). 
Because the edgenode 30W1 is a terminal node, its corresponding treenode 
labeled w1 is also a terminal node, thus the treenode labeled D1 remains the 
active node in the current forest FO.0.1 . 

The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30W1 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W1 (block 70). Because it is the only edgenode in the substitution list 
32W1 , it is necessarily the last one, so it is tested to see if it has a sibling to its 
immediate right (block 76). Because it does not have a sibling to its 
immediate right, the node closure operation is applied to the current forest 
FO.0.1 . The active node of the current forest FO.0.1 is the treenode labeled 
D1 . Thus the application of the node closure operation results in the treenode 
labeled CO (which is the parent node of the treenode labeled D1) becoming 
the new active node of the current forest FO.0.1 (block 77). The current 
edgenode is then tested to see if it has a parent (block 82). Because the 
current edgenode 30W1 has a parent (edgenode 30D1), the current 
edgenode becomes edgenode 30D1 (block 86). Because the current forest is 
FO.0.1 , not F0 (block 87), it is changed to the forest of the parent of the 
current edgenode (block 88), that is, to F0.0. 

The new current edgenode 30D1 is now tested to see if it is the last 
edgenode in the substitution list 32D (block 70). Because it is the last 
edgenode in the substitution list 32D, it is tested to see if it has a sibling to its 
immediate right (block 76). Because it has a sibling to its immediate right, the 
next edgenode is set to this sibling, that is, to edgenode 30W4 (block 78). 
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Because the next edgenode is not NULL (block 60), the current edgenode is 
set to the next edgenode 30W4 (block 61). 

Next, because the current edgenode 30W4 is the first edgenode in its 
substitution list 32W4 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W4, the length of the 
substitution list 32W4 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest FO.O is performed with the 
current edgenode (block 55), resulting in a treenode labeled w4 being added 
to the current forest FO.O. Because the current forest FO.O contains the two 
forests FO.0.0 and FO.0.1 , each of these forests is updated at its active node 
with a copy of the treenode labeled w4 (in both of these forests, the current 
active node is the treenode labeled CO). Because the edgenode 30W4 is a 
terminal node, its corresponding treenode labeled w4 is also a terminal node, 
thus the active nodes of the forests in the current forest FO.O are not changed. 

The next edgenode is now obtained (block 59) as shown in FIG- 3A. 
Because the current edgenode 30W4 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W4 (block 70). Because it is the only edgenode in the substitution list 
32W4, it is necessarily the last one, so it is tested to see if it has a sibling to its 
immediate right (block 76). Because it has a sibling to its immediate right, the 
next edgenode is set to this sibling, that is, to edgenode 30W5 (block 78). 
Because the next edgenode is not NULL (block 60), the current edgenode is 
set to the next edgenode 30W5 (block 61). 

Next, because the current edgenode 30W5 is the first edgenode in its 
substitution list 32W5 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W5, the length of the 
substitution list 32W5 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest FO.O is performed with the 
current edgenode (block 55), resulting in a treenode labeled w5 being added 
to the current forest FO.O. Because the current forest FO.O contains the two 
forests FO.0.0 and FO.0.1 , each of these forests is updated at its active node 
with a copy of the treenode labeled w5 (in both of these forests, the current 
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active node is the treenode labeled CO). Because the edgenode 30W5 is a 

terminal node, its corresponding treenode labeled w5 is also a terminal node, 

thus the active nodes of the forests in the current forest FO.O are not changed. 

^^\^^ next edgenode is now obtained (block 59) as shown in FIG. 3A. 

5 s Because tnfe^urrent edgenode 30W5 does not have at least one child 

(block 66), it ist^sted to see if it is the last edgenode in the substitution list 

32W5 (block 70). Because it is the only edgenode in the substitution list 

32W5, it is necessarilysjhe last one, so it is tested to see if it has a sibling to its 

immediate right (block 76)\Because it does not have a sibling to its 

10 immediate right, the node clo^re operation is applied to the current forest 

CJ FO.O (block 77). Because the cun^nt forest FO.O contains the two forests 

ill FO.0.0 and F0.0.1 , the application of me node closure operation to it results in 

jti the node closure operation being applieato each of these forests in turn. The 

jJJ active node of the forest FO.0.0 is the treenoabslabeled CO. Thus the 

fi application of the node closure operation to the fcxest FO.0.0 results in the 

1 treenode labeled A (which is the parent node of the treenode labeled CO) 

□ becoming the new active node of the forest FO.0.0. Similarly, the active node 

of the forest F0.0.1 is the treenode labeled CO. Thus the application of the 

y node closure operation to the forest FO.O. 1 results in the treenodel^beled A 

20 (which is the parent node of the tr eenode labeled CO) becoming the_ 

— ~* 

actrverrecie or tne rorestFO.0.1 . 

Because it does not have a sibling to its immediate right, the node 

closure operation is applied to the current forest FO.O, resulting in the parent 

treenode (labeled CO) of the treenode labeled w3 becoming the new active 

25 node of the current forest FO.O (block 77). The current edgenode is then 

tested to see if it has a parent (block 82). Because the current edgenode 
30W5 has a parent (edgenode 30C0), the current edgenode becomes 
edgenode 30C0 (block 86). Because the current forest is FO.O, not FO 
(block 87), it is changed to the forest of the parent of the current edgenode 

30 (block 88), that is, to FO. 

The new current edgenode 30C0 is now tested to see if it is the last 
edgenode in the substitution list 32C (block 70). Because it is not the last 
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edgenode in the substitution list 32C, the next edgenode is set to the next 
edgenode in the substitution list 32C (block 72), that is, to edgenode 30C1. 
The current forest is now set to the forest of edgenode 30C1 , that is, to the 
forest labeled F0.1 (block 74). Because the next edgenode is not NULL 
(block 60), the current edgenode is set to the next edgenode 30C1 (block 61). 

Because the current edgenode is not the first edgenode in the 
substitution list (block 52), the next edgenode is now obtained (block 59) as 
shown in FIG. 3A. Because the current edgenode 30C1 has at least one child 
(block 66), the next edgenode is set to the leftmost child of the current 
edgenode (block 68), that is, to edgenode 30W2. Because the next 
edgenode is not NULL (block 60), the current edgenode is set to the next 
edgenode 30W2 (block 61 ). 

Next, because the current edgenode 30W2 is the first edgenode in its 
substitution list 32W2 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W2, the length of the 
substitution list 32W2 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest F0.1 is performed with the 
current edgenode (block 55), resulting in a treenode labeled w2 being added 
to the current forest F0.1 , under its active node (the treenode labeled C1). 
Because the edgenode 30W2 is a terminal node, its corresponding treenode 
labeled w2 is also a terminal node, thus the treenode labeled C1 remains the 
active node in the current forest F0.1 . 

The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30W2 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W2 (block 70). Because it is the only edgenode in the substitution list 
32W2, it is necessarily the last one, so it is tested to see if it has a sibling to its 
immediate right (block 76). Because it has a sibling to its immediate right, the 
next edgenode is set to this sibling, that is, to edgenode 30W3 (block 78). 
Because the next edgenode is not NULL (block 60), the current edgenode is 
set to the next edgenode 30W3 (block 61). 
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Next, because the current edgenode 30W3 is the first edgenode in its 
substitution list 32W3 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W3, the length of the 
substitution list 32W3 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest F0.1 is performed with the 
current edgenode (block 55), resulting in a treenode labeled w3 being added 
to the current forest F0.1 , under its active node (the treenode labeled C1). 
Because the edgenode 30W3 is a terminal node, its corresponding treenode 
labeled w3 is also a terminal node, thus the treenode labeled C1 remains the 
active node in the current forest F0.1 . 

The next edgenode is now obtained (block 59) as shown in FIG- 3A. 
Because the current edgenode 30W3 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W3 (block 70). Because it is the only edgenode in the substitution list 
32W3, it is necessarily the last one, so it is tested to see if it has a sibling to its 
immediate right (block 76). Because it has a sibling to its immediate right, the 
next edgenode is set to this sibling, that is, to edgenode 30E (block 78). 
Because the next edgenode is not NULL (block 60), the current edgenode is 
set to the next edgenode 30E (block 61). 

Next, because the current edgenode 30E is the first edgenode in its 
substitution list 32E (block 52), N is set to the length of the substitution list for 
the current edgenode (block 53). For edgenode 30E, the length of the 
substitution list 32E is equal to 1 . Because N is not greater than 1 (block 54) 
a scalar update of the current forest F0.1 is performed with the current 
edgenode (block 55), resulting in a treenode labeled E being added to the 
current forest F0.1 , under its active node (the treenode labeled C1). Because 
the edgenode 30E is a nonterminal node, its corresponding treenode labeled 
E is also a nonterminal node, thus the nonterminal treenode labeled E 
becomes the new active node in the current forest F0.1 . 

The next edgenode is then obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30E has at least one child (block 66), the next 
edgenode is set to the leftmost child of the current edgenode (block 68), that 
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is, to edgenode 30W4. Because the next edgenode is not NULL (block 60), 
the current edgenode is set to the next edgenode 30W4 (block 61). 

Next, because the current edgenode 30W4 is the first edgenode in its 
substitution list 32W4 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W4, the length of the 
substitution list 32W4 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest F0.1 is performed with the 
current edgenode (block 55), resulting in a treenode labeled w4 being added 
to the current forest F0.1 , under its active node (the treenode labeled E). 
Because the edgenode 30W4 is a terminal node, its corresponding treenode 
labeled w4 is also a terminal node, thus the treenode labeled E remains the 
active node in the current forest F0.1 . 

The next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because the current edgenode 30W4 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W4 (block 70). Because it is the only edgenode in the substitution list 
32W4, it is necessarily the last one, so it is tested to see if it has a sibling to its ; 
immediate right (block 76). Because it has a sibling to its immediate right, the 
next edgenode is set to this sibling, that is, to edgenode 30W6 (block 78). 
Because the next edgenode is not NULL (block 60), the current edgenode is 
set to the next edgenode 30W6 (block 61 ). 

Next, because the current edgenode 30W6 is the first edgenode in its 
substitution list 32W6 (block 52), N is set to the length of the substitution list 
for the current edgenode (block 53). For edgenode 30W6, the length of the 
substitution list 32W6 is equal to 1 . Because N is not greater than 1 
(block 54) a scalar update of the current forest F0.1 is performed with the 
current edgenode (block 55), resulting in a treenode labeled w6 being added 
to the current forest F0.1, under its active node (the treenode labeled E). 
Because the edgenode 30W6 is a terminal node, its corresponding treenode 
labeled w6 is also a terminal node, thus the treenode labeled E remains the 
active node in the current forest F0.1 . 
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e next edgenode is now obtained (block 59) as shown in FIG. 3A. 
Because tnfescurrent edgenode 30W6 does not have at least one child 
(block 66), it is tested to see if it is the last edgenode in the substitution list 
32W6 (block 70). Befcmjse it is the only edgenode in the substitution list 
5 32W6 should be insertea&s shown, it is necessarily the last one, so it is 

tested to see if it has a siblingtqits immediate right (block 76). Because it 
does not have a sibling to its immediate right, the node closure operation is 
applied to the current forest F0.1 , resulting in the parent treenode of the 
treenode labeled E (i.e., the treenode labeU^CI) becoming the new active 
10 node of the current forest F0.1 (block 77). The current edgenode is then 

a -=j tested to see if it has a parent (block 82). Because the^current edgenode 

IJ? 30W6 has a parent (edgenode 30E), the current edgenod^feiecomes 

edgenode 30E (block 86). Because the current forest is F0.1 , nbt^R) 
(block 87), it is changed to the forest of the parent of the current^dgen&de 
tS (bJock- 88), that i s , to F0.1 (th u s r e ma i rn ngrrnchaiTge^ 

The new current edgenode 30E is now tested to see if it is the last 
P edgenode in the substitution list 32E (block 70). Because it is the only 

in edgenode in the substitution list 32E, it is necessarily the last one, so it is 

tested to see if it has a sibling to its immediate right (block 76). Because it 
20 does not have a sibling to its immediate right, the node closure operation is 

applied to the current forest F0.1 , resulting in the parent treenode of the 
treenode labeled C1 (i.e., the treenode labeled A) becoming the new active 
node of the current forest F0.1 (block 77). The current edgenode is then 
tested to see if it has a parent (block 82). Because the current edgenode 30E 
25 has a parent (edgenode 30C1), the current edgenode becomes edgenode 

30C1 (block 86). Because the current forest is F0.1 , not F0 (block 87), it is 
changed to the forest of the parent of the current edgenode (block 88), that is, 
to F0. 

The new current edgenode 30C1 is now tested to see if it is the last 
30 edgenode in the substitution list 32C (block 70). Because it is the last 

edgenode in the substitution list 32C, it is tested to see if it has a sibling to its 
immediate right (block 76). Because it does not have a sibling to its 
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immediate right, the node closure operation is applied to the current forest FO. 
Because the forest FO contains the forests F0.1 and FO.O (and the forest FO.O 
itself contains the forests FO.0.0 and FO.0.1), the node closure operation is 
applied to each of these contained forests. Since the active node in forest 
F0.1 is the treenode labeled C1 , it is closed and the new active node in this 
forest becomes the treenode labeled A. Similarly, since the active node in 
forest FO.0.0 is the treenode labeled CO, it is closed and the new active node 
in this forest becomes the treenode labeled A. Finally, since the active node 
in forest FO.0.1 is the treenode labeled CO, it is closed and the new active 
node in this forest becomes the treenode labeled A. The current edgenode is 
then tested to see if it has a parent (block 82). Because the current edgenode 
30C1 has a parent (edgenode 30A), the current edgenode becomes 
edgenode 30A (block 86). Because the current forest is FO (block 87) it 
remains unchanged. 

The new current edgenode 30A is now tested to see if it is the last 
edgenode in the substitution list 32A (block 70). Because it is the only 
edgenode in the substitution list 32A, it is necessarily the last one, so it is 
tested to see if it has a sibling to its immediate right (block 76). Because it 
does not have a sibling to its immediate right, the node closure operation is 
applied to the current forest FO. Because the forest FO contains the forests 
F0.1 and FO.O (and the forest FO.O itself contains the forests FO.0.0 and 
FO.0.1), the node closure operation is applied to each of these contained 
forests. Since the active node in each of these forests is the treenode labeled 
A (which is the root treenode of each foi^§t)rtRis7esuTts in each forest being 
completed. Because the current edgenode 30A is the root edgenode of the 
packed representation, it does not have a parent, so the next edgenode is set 
to NULL (block 84). 

Because the next edgenode is NULL (block 60), the unpacking module 
is finished processing (block 62). 

Referring to FIG. 7, the unpacked forest 110 contains each of the three 
parses, including the most probable parse (36), alternative parse 1 (37) and 
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alternative parse 2 (38). The parse trees in the unpacked forest object 22 are 
readable by conventional semantic interpreters. 

As discussed above, although the most probable parse is guaranteed 
to be the first one returned by this algorithm, the relative order of the 
remaining parses must be determined by reference to their relative weights. 
This is accomplished by merging the parse trees contained in a particular 
Forest object into a frequency-sorted list that can be returned by a 
Forest-class accessor. This list is a static data member of the Forest class, 
into which a pointer to each Tree is inserted upon Tree completion (thus 
obviating the need for an additional traversal of the Forest to obtain these 
pointers). 

While the embodiments of the invention disclosed herein are presently 
considered to be preferred, various changes and modifications can be made 
without departing from the spirit and scope of the invention. The scope of the 
invention is indicated in the appended claims, and all changes that come 
within the meaning and range of equivalents are intended to be embraced 
therein. 



